9 Robust Factor Analysis: Methods and Applications

نویسنده

  • Peter Filzmoser
چکیده

The word robustness is frequently used in the literature, and is often stated with completely di®erent meaning. In this contribution robustness means to reduce the in°uence of \unusual" observations on statistical estimates. Such observations are frequently denoted as outliers, and are often thought to be extreme values caused by measurement or transcription errors. However, the notion of outliers also includes observations (or groups of observations) which are inconsistent with the remaining data set. The judgment whether observations are declared as outliers or as \inliers" is sometimes subjective, and robust statistics should serve as a tool for an objective decision. In terms of a statistical model, robust statistics could be de¯ned as follows \In a broad informal sense, robust statistics is a body of knowledge, partly formalized intòtheory of robustness', relating to deviations from idealized assumptions in statistics. statistics is aimed at yielding reliable results in cases where classical assumptions like normality, independence or linearity are violated. Real data sets almost always include outliers. Sometimes they are harmless and do not change the results if they are included in the analysis or deleted beforehand. However, they can have a major in°uence on the results, and completely alter the statistical estimates. Deleting such observations before analyzing the data would be a way out, but this implies that the outliers can indeed be identi¯ed, which is not trivial, especially in higher dimensions (see Section 9.2.3). Another way to reduce the in°uence of outliers is to ¯t the majority of the data, which is assumed to be the \good" part of data points. The majority ¯t is done by introducing a weight function for downweighting outliers, with weights equal to 0 and 1 or taken at the continuous scale. This process where outliers are not excluded beforehand but downweighted in the analysis is called a robust procedure. The outliers can be identi¯ed afterwards by looking at the values of the weight function or by inspecting the residuals which are large in the robust analysis. In either case, outliers should not simply be deleted or ignored. Rather, an important task is to ask what has caused these outliers. They have to be analyzed and interpreted because they often contain very important information on data quality or unexpected behavior of some observations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development, Factor Analysis, and Validation of an EFL Teacher Change Scale (TCS)

The concept of teacher change is critical in second language teaching and English as a Foreign Language (EFL) context due largely to the fact that, almost, whatever we do in teacher education looks for initiating change of one sort or another. A substantial body of research has been dedicated to investigate teacher change (TC) from various perspectives.  However, having studied the related lite...

متن کامل

The Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran

The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...

متن کامل

A TRUST-REGION SEQUENTIAL QUADRATIC PROGRAMMING WITH NEW SIMPLE FILTER AS AN EFFICIENT AND ROBUST FIRST-ORDER RELIABILITY METHOD

The real-world applications addressing the nonlinear functions of multiple variables could be implicitly assessed through structural reliability analysis. This study establishes an efficient algorithm for resolving highly nonlinear structural reliability problems. To this end, first a numerical nonlinear optimization algorithm with a new simple filter is defined to locate and estimate the most ...

متن کامل

Design of robust carrier tracking systems in high dynamic and high noise conditions, with emphasis on neuro-fuzzy controller

The robust carrier tracking is defined as the ability of a receiver to determine the phase and frequency of the input carrier signal in unusual conditions such as signal loss, input signal fading, high receiver dynamic, or other destructive effects of propagation. An implementation of tight tracking can be understood in terms of adopting a very narrow loop bandwidth that contradict with the req...

متن کامل

Simultaneous robust estimation of multi-response surfaces in the presence of outliers

A robust approach should be considered when estimating regression coefficients in multi-response problems. Many models are derived from the least squares method. Because the presence of outlier data is unavoidable in most real cases and because the least squares method is sensitive to these types of points, robust regression approaches appear to be a more reliable and suitable method for addres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001